AITopics | internal medicine

2507.22504

Country:

North America > United States > Florida > Miami-Dade County > Miami (0.04)
Asia > China (0.04)

Genre: Research Report > New Finding (0.48)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Health Care Providers & Services (1.00)
Health & Medicine > Therapeutic Area > Neurology (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Truyts, Cesar Augusto Madid, Rabelo, Amanda Gomes, de Souza, Gabriel Mesquita, Lages, Daniel Scaldaferri, Pereira, Adriano Jose, Flato, Uri Adrian Prync, Reis, Eduardo Pontes dos, Vieira, Joaquim Edson, Silveira, Paulo Sergio Panse, Junior, Edson Amaro

Zero-shot Performance of Generative AI in Brazilian Portuguese Medical Exam

arXiv.org Artificial IntelligenceJul-29-2025

Artificial intelligence (AI) has shown the potential to revolutionize healthcare by improving diagnostic accuracy, optimizing workflows, and personalizing treatment plans. Large Language Models (LLMs) and Multimodal Large Language Models (MLLMs) have achieved notable advancements in natural language processing and medical applications. However, the evaluation of these models has focused predominantly on the English language, leading to potential biases in their performance across different languages. This study investigates the capability of six LLMs (GPT-4.0 Turbo, LLaMA-3-8B, LLaMA-3-70B, Mixtral 8x7B Instruct, Titan Text G1-Express, and Command R+) and four MLLMs (Claude-3.5-Sonnet, Claude-3-Opus, Claude-3-Sonnet, and Claude-3-Haiku) to answer questions written in Brazilian spoken portuguese from the medical residency entrance exam of the Hospital das Clínicas da Faculdade de Medicina da Universidade de São Paulo (HCFMUSP) - the largest health complex in South America. The performance of the models was benchmarked against human candidates, analyzing accuracy, processing time, and coherence of the generated explanations. The results show that while some models, particularly Claude-3.5-Sonnet and Claude-3-Opus, achieved accuracy levels comparable to human candidates, performance gaps persist, particularly in multimodal questions requiring image interpretation. Furthermore, the study highlights language disparities, emphasizing the need for further fine-tuning and data set augmentation for non-English medical AI applications. Our findings reinforce the importance of evaluating generative AI in various linguistic and clinical settings to ensure a fair and reliable deployment in healthcare. Future research should explore improved training methodologies, improved multimodal reasoning, and real-world clinical integration of AI-driven medical assistance.

accuracy, large language model, machine learning, (18 more...)

2507.19885

Country:

South America > Brazil > São Paulo (0.24)
North America > United States (0.04)
Europe > Spain (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (0.70)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.70)

arXiv.org Artificial IntelligenceMay-28-2025

Comparisons between a Large Language Model-based Real-Time Compound Diagnostic Medical AI Interface and Physicians for Common Internal Medicine Cases using Simulated Patients

Park, Hyungjun, Woo, Chang-Yun, Lim, Seungjo, Lim, Seunghwan, Kwak, Keunho, Jeong, Ju Young, Suh, Chong Hyun

Objective To develop an LLM based realtime compound diagnostic medical AI interface and performed a clinical trial comparing this interface and physicians for common internal medicine cases based on the United States Medical License Exam (USMLE) Step 2 Clinical Skill (CS) style exams. Methods A nonrandomized clinical trial was conducted on August 20, 2024. We recruited one general physician, two internal medicine residents (2nd and 3rd year), and five simulated patients. The clinical vignettes were adapted from the USMLE Step 2 CS style exams. We developed 10 representative internal medicine cases based on actual patients and included information available on initial diagnostic evaluation. Primary outcome was the accuracy of the first differential diagnosis. Repeatability was evaluated based on the proportion of agreement. Results The accuracy of the physicians' first differential diagnosis ranged from 50% to 70%, whereas the realtime compound diagnostic medical AI interface achieved an accuracy of 80%. The proportion of agreement for the first differential diagnosis was 0.7. The accuracy of the first and second differential diagnoses ranged from 70% to 90% for physicians, whereas the AI interface achieved an accuracy rate of 100%. The average time for the AI interface (557 sec) was 44.6% shorter than that of the physicians (1006 sec). The AI interface ($0.08) also reduced costs by 98.1% compared to the physicians' average ($4.2). Patient satisfaction scores ranged from 4.2 to 4.3 for care by physicians and were 3.9 for the AI interface Conclusion An LLM based realtime compound diagnostic medical AI interface demonstrated diagnostic accuracy and patient satisfaction comparable to those of a physician, while requiring less time and lower costs. These findings suggest that AI interfaces may have the potential to assist primary care consultations for common internal medicine cases.

artificial intelligence, internal medicine, simulated patient, (4 more...)

2505.20609

Country: North America > United States (0.24)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Internal Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (1.00)

arXiv.org Artificial IntelligenceJun-30-2024

Evaluation of Bias Towards Medical Professionals in Large Language Models

Chen, Xi, Xu, Yang, You, MingKe, Wang, Li, Liu, WeiZhi, Li, Jian

This study evaluates whether large language models (LLMs) exhibit biases towards medical professionals. Fictitious candidate resumes were created to control for identity factors while maintaining consistent qualifications. Three LLMs (GPT-4, Claude-3-haiku, and Mistral-Large) were tested using a standardized prompt to evaluate resumes for specific residency programs. Explicit bias was tested by changing gender and race information, while implicit bias was tested by changing names while hiding race and gender. Physician data from the Association of American Medical Colleges was used to compare with real-world demographics. 900,000 resumes were evaluated. All LLMs exhibited significant gender and racial biases across medical specialties. Gender preferences varied, favoring male candidates in surgery and orthopedics, while preferring females in dermatology, family medicine, obstetrics and gynecology, pediatrics, and psychiatry. Claude-3 and Mistral-Large generally favored Asian candidates, while GPT-4 preferred Black and Hispanic candidates in several specialties. Tests revealed strong preferences towards Hispanic females and Asian males in various specialties. Compared to real-world data, LLMs consistently chose higher proportions of female and underrepresented racial candidates than their actual representation in the medical workforce. GPT-4, Claude-3, and Mistral-Large showed significant gender and racial biases when evaluating medical professionals for residency selection. These findings highlight the potential for LLMs to perpetuate biases and compromise healthcare workforce diversity if used without proper bias mitigation strategies.

language model, medicine, specialty, (17 more...)

2407.12031

Country:

North America > United States (0.14)
Asia > China > Sichuan Province > Chengdu (0.04)

Genre: Research Report > New Finding (0.67)

Industry: Health & Medicine > Therapeutic Area > Orthopedics/Orthopedic Surgery (0.36)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.76)

Washington Post - Technology NewsJan-30-2023, 03:08:05 GMT

'The Last of Us' tells a new but familiar queer love story

But however revolutionary their deaths might be for the universe of "The Last of Us," they still fall into well-worn gay death tropes. It seems that Bill is older than Frank, but Frank succumbs to an unspecified illness and ends up infirm, which ultimately prompts his suicide. If you grew up queer in the 80s and 90s, the image of one gay man pushing another in a wheelchair might look fiercely familiar from the early days of the AIDS crisis and the storytelling that came out of it. Many cis gay men of my generation believed this kind of death was inevitable, that they would die tended to by a lover or they would be the widower left behind. Bill rebels against this trope by dying alongside Frank, but as I watched (and cried) as Bill wheeled Frank around their house and handed him his pills, I thought of how many times I had seen this scene in other movies and television. I wondered why the show's creators chose to have Frank sicken to lead to Bill and Frank's deaths when one or both of their ages could have been the inciting factor.

artificial intelligence, familiar queer love story, health & medicine, (6 more...)

Washington Post - Technology News

Industry:

Leisure & Entertainment (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.87)

Technology: Information Technology > Artificial Intelligence > Games (0.40)

#artificialintelligenceJan-4-2023, 22:15:29 GMT

How Data and Smart Technology Are Helping Hospitalists

The increasing complexity of patient care, difficulties with time management, and managing administrative tasks while complying with regulations are a few overarching difficulties that go hand-in-hand with the job. Fortunately, big data and smart technology are helping hospitalists overcome these issues. Here are some fascinating ways data and smart technology are helping hospitalists. Medical billing is notoriously erroneous. Some estimates propose that upward of 80% of medical bills have errors.

artificial intelligence, health & medicine, machine learning, (14 more...)

Genre: Research Report (0.34)

Industry: Health & Medicine > Therapeutic Area > Internal Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.33)
Information Technology > Architecture > Real Time Systems (0.30)

#artificialintelligenceNov-30-2022, 13:10:42 GMT

The top 100 new technology innovations of 2022

On a cloudy Christmas morning last year, a rocket carrying the most powerful space telescope ever built blasted off from a launchpad in French Guiana. After reaching its destination in space about a month later, the James Webb Space Telescope (JWST) began sending back sparkling presents to humanity--jaw-dropping images that are revealing our universe in stunning new ways. Every year since 1988, Popular Science has highlighted the innovations that make living on Earth even a tiny bit better. And this year--our 35th--has been remarkable, thanks to the successful deployment of the JWST, which earned our highest honor as the Innovation of the Year. But it's just one item out of the 100 stellar technological accomplishments our editors have selected to recognize. The list below represents months of research, testing, discussion, and debate. It celebrates exciting inventions that are improving our lives in ways both big and small. These technologies and discoveries are teaching us about the ...

consumer health, electrical industrial apparatus, passenger transportation, (34 more...)

Country:

North America > United States > Wisconsin (0.28)
South America > French Guiana (0.24)
Europe > Germany (0.14)
(7 more...)

Genre:

Research Report > New Finding (0.67)
Personal > Honors (0.48)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Transportation > Electric Vehicle (1.00)
(21 more...)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.67)
Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.46)

#artificialintelligenceMay-22-2022, 02:56:25 GMT

This AI tool predicts whether COVID patients will live or die

A tool has been developed to help healthcare professionals identify hospitalised patients most at risk of dying from COVID-19 using artificial intelligence (AI). The algorithm could help doctors to direct critical care resources to those in most immediate need, which the developers of the AI tool say could be especially valuable to resource-limited countries. And with no end in sight for the coronavirus pandemic, with new variants leading to fresh waves of sickness and hospitalisation, the scientists behind the tool say there is a need for generalised tools like this which can be easily rolled out. To develop the tool, scientists used biochemical data from routine blood samples taken from nearly 30,000 patients hospitalised in over 150 hospitals in Spain, the US, Honduras, Bolivia and Argentina between March 2020 and February 2022. Taking blood from so many patients meant the team were able to capture data from people with different immune statuses – vaccinated, unvaccinated and those with natural immunity – and from people infected with every variant of COVID-19.

hospitalisation, medicine, variant, (10 more...)

Country:

South America > Bolivia (0.26)
South America > Argentina (0.26)
North America > Honduras (0.26)
(3 more...)

Genre: Research Report (0.35)

Industry: Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)

Technology: Information Technology > Artificial Intelligence (1.00)

Benavoli, Alessio, de Campos, Cassio

Bayesian Kernelised Test of (In)dependence with Mixed-type Variables

arXiv.org Machine LearningMay-9-2021

A fundamental task in AI is to assess (in)dependence between mixed-type variables (text, image, sound). We propose a Bayesian kernelised correlation test of (in)dependence using a Dirichlet process model. The new measure of (in)dependence allows us to answer some fundamental questions: Based on data, are (mixed-type) variables independent? How likely is dependence/independence to hold? How high is the probability that two mixed-type variables are more than just weakly dependent? We theoretically show the properties of the approach, as well as algorithms for fast computation with it. We empirically demonstrate the effectiveness of the proposed method by analysing its performance and by comparing it with other frequentist and Bayesian approaches on a range of datasets and tasks with mixed-type variables.

gdp, immunology, upstream oil & gas, (19 more...)

arXiv.org Machine Learning

2105.04001

Country:

North America > United States (0.28)
Europe > Netherlands (0.14)
Europe > Ireland (0.14)

Genre: Research Report > Experimental Study (0.67)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Government (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.48)

#artificialintelligenceMar-11-2021, 00:41:32 GMT

4 Tips to Improve Your Statistical Literacy

Statistical literacy (assessing statistical statements, arguments and associations) is extremely important for producing and interpreting results from data analysis, yet it usually isn't a part of mainstream statistics education [1]. From the correlation-causation error to immortal time bias, there are many ways to invalidate your results. You can lessen the odds by following a few good practices. When you design your analysis, make sure you're asking the right question. This isn't always easy, as the German Federal Ministry of the Interior, Building and Home Affairs found out after publishing a 2018 press release concerning the "successful" use of facial recognition technology at train stations [2].

false positive rate, oscar winner, statistical literacy, (8 more...)

Country: Europe > Germany (0.25)

Genre: Research Report (0.33)

Industry:

Government > Regional Government (0.36)
Government > Interior (0.36)
Media > Film (0.35)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (0.38)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.34)